Prediction of F0 parameter of contextualized utterances in dialogue
نویسندگان
چکیده
In order to synthesize natural spoken dialogue, it is necessary to incorporate dialogue information into generation of the surface sentence and the prosody. This paper describes the prediction of F0 maximum for minor phrases in dialogue based on a two-step predictive method. Special attentions are directed to speci c phrases containing the person's name or the day of the week in the schedule arrangement task in order to narrow the diversity of characteristics of F0 parameters in dialogue. Seven features were identi ed as dialogue information which are useful to predict the F0 parameter. Two D-rule sets derived from the person's name or the day of the week are very similar to one another. They reduce the total prediction errors by about 50% for the data which have much in uence of dialogue context.
منابع مشابه
Spontaneous dialogue: some results about the F0 predictions of a pragmatic model of information processing
This paper presents the first results of a semanticpragmatic model which assigns a specific label to the relevant words of dialogue utterances and predicts their F0 value. The originality of this work lies in the kind of utterances the model has been designed for: dialogue utterances. The labels of the model represent the degrees of both the expected/unexpected and known/unknown aspects of the ...
متن کاملAnalysis of Changes in Dialogue Rhythm Due to Dialogue Acts in Task-Oriented Dialogues
We consider that factors such as prosody of systems’ utterances and dialogue rhythm are important to attain a natural human-machine dialogue. However, the relations between dialogue rhythm and speaker’s various states in task-oriented dialogue have been not revealed. In this study, we collected taskoriented dialogues and analyzed the relations between “dialogue structures, kinds of dialogue act...
متن کاملPerception of "tonal focus" in Greek
The present paper reports on the way tonal correlates of focus impact its identification in Greek simple declaratives. The material was based on 4 utterances with different focus placement. Manipulation of the F0 contour and duration of the original utterances resulted in a list of 18 utterances, repeated 10 times each, and presented randomly to each of 10 informants. The informants were asked ...
متن کاملProsodic features associated with the distribution of turns in Finnish informal dialogues
In free dialogue, the speaking time may be distributed among the speakers in various ways. These turn-taking dynamics probably reflect interactional settings. The timing of utterances and pauses has been studied as early as in the 1930s, whereas the long-term prosodic properties of interaction and turn-taking dynamics have received less attention. In this preliminary study, we explore and visu...
متن کاملData-driven emotion conversion in spoken English
This paper describes an emotion conversion system that combines independent parameter transformation techniques to endow a neutral utterance with a desired target emotion. A set of prosody conversion methods have been developed which utilise a small amount of expressive training data ( 15 min) and which have been evaluated for three target emotions: anger, surprise and sadness. The system perfo...
متن کامل